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Abstract: Short-term climate reconstruction, i.e., the reproduction of short-term (several decades) historical 
climatic time series based on the relationship between observed data and available longer-term reference 
data in a certain area, can extend the length of climatic time series and offset the shortage of observations. 
This can be used to assess regional climate change over a much longer time scale. Based on monthly grid 
climate data from a Coupled Model Inter-comparison Project phase 5 (CMIP5) dataset for the period of 
1850-2000, the Climatic Research Unit (CRU) dataset for the period of 1901—2000 and the observed data 
from 53 meteorological stations located in the Tianshan Mountains region (TMR) of China during the 
period of 1961—2011, we calibrated and validated monthly average temperature (MAT) and monthly 
accumulated precipitation (MAP) in the TMR using the delta, physical scaling (SP) and artificial neural 
network (ANN) methods. Performance and uncertainty during the calibration (1971-1999) and verification 
(1961-1970) periods were assessed and compared using traditional performance indices and a revised set 
pair analysis (RSPA) method. The calibration and verification processes were subjected to various sources 
of uncertainty due to the influence of different reconstructed variables, different data sources, and/or 
different methods used. According to traditional performance indices, both the CRU and CMIP5 datasets 
resulted in satisfactory calibrated and verified MAT time series at 53 meteorological stations and MAP time 
series at 20 meteorological stations using the delta and SP methods for the period of 1961—1999. However, 
the results differed from those obtained by the RSPA method. This showed that the CRU dataset produced 
a low degree of uncertainty (positive connection degree) during the calibration and verification of MAT 
using the delta and SP methods compared to the CMIP5 dataset. Overall, the calibrated and verified MAP 
had a high degree of uncertainty (negative connection degree) regardless of the dataset or reconstruction 
method used. Therefore, the reconstructed time series of MAT for the period of 1850 (or 1901)--1960 
based on the CRU and CMIP5 datasets using the delta and SP methods could be used for further study. The 
results of this study will be useful for short-term (several decades) regional climate reconstruction and 
longer-term (100 a or more) assessments of regional climate change. 
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1 Introduction 


Understanding past climates is critical for assessing the context of the natural climate and its 
consistency (Wu et al., 2010). The long-term continual observation of climate is a precondition for 
studying climate change (Misra et al., 2012; Ghosh, 2018). Both globally and in the Northern 
Hemisphere, the 20" century is considered the warmest century during the last 1000 a, but 
uncertainty still remains regarding the regional assessment of warming during the whole 20" 
century in western China, including northwestern China and the Tibetan Plateau, where recorded 
climatic data were rarely available prior to the 1950s (Fang et al., 2012; Li et al., 2013; Shen et al., 
2019). The climate in northwestern China changed abruptly in the mid-1980s, with a shift from 
warm-dry to warm-wet conditions (Shi et al., 2007). However, it is not known whether climate 
change in northwestern China occurs at decadal or centennial time scales (Shi et al., 2007; Li et al., 
2013). Furthermore, it is difficult to quantify and assess climate change over long historical periods 
and its impact on water resources, ecosystems, agriculture and other activities due to the shortage 
of instrumental records (Misra et al., 2012; Li et al., 2013; Yao et al., 2013; Shen et al., 2019). 

In recent years, many studies have attempted to reconstruct local paleoclimate variability by 
applying climate proxy data, including fossil pollens, ice cores, lake sediment cores, tree rings and 
historical documents (Bradley and Jones, 1992; Shen et al., 2001; Esper et al., 2002; Yang et al., 
2002; Fang et al., 2012; Wang et al., 2017; Klippel et al., 2018). This kind of reconstruction could 
provide long-term historical climatic time series. However, the process of paleoclimate 
reconstruction requires much time and effort. In addition, the uncertainty of such reconstructions is 
very high and the temporal resolution is low. Available comparatively long-term grid data, such as 
the Coupled Model Inter-comparison Project phase 5 (CMIP5) and Climate Research Unit (CRU) 
datasets, can provide historical climate data for historic periods extending over 100 a ago (Taylor et 
al., 2012; University of East Anglia Climatic Research Unit et al., 2017). These data offer an 
opportunity to back cast the historical local climate data in regions with a lack of long-term 
historical data (Li et al., 2013; Shen et al., 2019). 

Single-value indices, such as the Nash-Sutcliffe coefficient (NSC) and percentage bias (PBIAS), 
are quantitatively used to assess the performance of hydrological models (e.g., the SDHydro 
model) by assessing the quantitative relationship between the observed and estimated runoff 
(Moriasi et al., 2007; Chen et al., 2012; Zhang et al., 2016a, b). They are also used to evaluate the 
capacity of wind erosion models, such as the Agricultural Policy Environmental eXtender (APEX) 
model (Wang and Li, 2012; Pi et al., 2017) and some environmental models (e.g., Hauduc et al., 
2015). Chen et al. (2012) extended these single-value indices to the assessment of statistical 
downscaling methods. Li et al. (2013) analyzed the performance of reconstructions using these 
indices to test the quantitative relationship between the observed and reconstructed (calibrated and 
verified) hydro-meteorological time series. However, these indices cannot completely assess the 
qualitative and quantitative uncertainty associated with climate reconstruction (Li et al., 2013). 
There is a need to determine how best to qualitatively and quantitatively assess the reconstruction 
performance and uncertainty associated with different data sources and reconstruction methods. 

Based on the principles of set pair analysis (SPA) (Zhao, 1989; Zhao and Xuan, 1996), we 
developed a new method, namely, the revised SPA (RSPA), in this study. Then, we used this 
method to assess the performance and uncertainty of a short-term climate reconstruction in the 
Tianshan Mountains region (TMR) of China on the basis of two data sources and three 
reconstruction methods. This study will be useful and significant for short-term (several decades) 
regional climate reconstruction and longer-term (100 a or more) assessments of regional climate 
change. 
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2 Study area and data 


2.1 Study area 


Located in Central Asia and northwestern China, the Tianshan Mountains stretch approximately 
2500 km in total, with about 1700 km in China. The area within China, about 5.7x10° km?, 
accounts for 34.5% of the total area of Xinjiang Uygur Autonomous Region (Hu, 2004) and is 
referred to as the TMR in this study (Fig. 1). The TMR starts from the Tarim Basin in the south and 
stretches to the Jungar Basin in the north. It forms the climate watershed between the northern and 
southern parts of Xinjiang (Wei et al., 2008). The average height of the mountain ridge is 4000 m 
a.s.l., with the highest peak being Tomor (7435 m a.s.1.). The TMR is a typical alpine mountainous 
area that is characterized by a continental climate, with large seasonal differences. Precipitation, 
which is mainly affected by the westerly circulation and topography, is generally regarded as the 
main source of water resources in Xinjiang (Liu et al., 2015; Zhang et al., 2019). 

At the spatial scale, meteorological stations with long-term time series are unevenly distributed 
in the TMR (Fig. 1). At the temporal scale, meteorological records began in the late 1950s and no 
data are available prior to this time. The existing recorded data indicate that the annual average 
temperature is 7.7°C and is higher than 0.0°C for most meteorological stations, except those with 
elevations above 2000 m a.s.l. (e.g., Bayanbulak, Daxigou and Tuergate stations). The average 
annual precipitation is 189.58 mm and is larger at stations with higher elevations, such as 
Bayanbulak, Daxigou, Tianchi, Tuergate and Zhaosu stations (Li et al., 2016). 
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Fig. 1 Distribution of the meteorological stations in the Tianshan Mountains region (TMR) of China. The full 
name and descriptive information of all meteorological stations can be seen in Table S1 in the Appendix. 


2.2 Data 


Observed monthly data and two sources of grid monthly data were used to reconstruct historical 
climatic time series in the TMR: (1) observed data from meteorological stations in the TMR; (2) 
grid data from the CRU dataset; and (3) a CMIP5 dataset of past and current climatic conditions. 
The monthly average temperature (MAT) and monthly accumulated precipitation (MAP) at 62 
meteorological stations in the TMR were the target climate records for calibration and verification. 
The CRU TS v4.03 dataset covers the period of 1901-2018. Time series datasets for the 20% 
century indicate the month-by-month variation in climate at 0.5° spatial resolution. The variables 
include MAT, MAP, diurnal temperature range, monthly average daily maximum temperature and 
cloud cover (Harris et al., 2014; University of East Anglia Climatic Research Unit et al., 2020). 

The CMIP5 multi-model dataset was compiled by the Working Group on Coupled Modelling 
(WGCM) and Program for Climate Model Diagnosis and Intercomparison (PCMDI) (Taylor et al., 
2012). This multi-model dataset included the outputs from 42 global climate models (GCMs) (for 
details see Table S2 in the Appendix). Among these GCMs, 42 provided the MAP and 41 provided 
the MAT. Therefore, the MAP from 42 GCMs and MAT from 41 GCMs were used in this study. 
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These monthly data were subjected to a guality control procedure, which involved a visual 
examination of individual stations or grid records to identify outliers. Abnormal values were 
identified and either removed or corrected. Only a small fraction of the data needed correction. 
Missing monthly data, accounting for less than 2% of the total data, were estimated for a given 
month by extrapolating the average value of the data from the one or two preceding and one or two 
following records. Any time series with over one year of missing data was omitted from the 
research. Finally, the observed data from 53 meteorological stations in the TMR were used, while 
data from nine stations were omitted. The details of the 53 stations are provided in Table S1 in the 
Appendix. The near gridded values of MAT and MAP data from the CRU and CMIP5 datasets 
were interpolated into the 53 selected meteorological stations in the TMR by the bi-linear 
interpolation method (Huang et al., 2018). As with the observed data, a similar guality control 
procedure was applied to the data from the CRU and CMIP5 datasets. Finally, the interpolated 
MAT data from 41 GCMs in the CMIP5 and CRU datasets and the interpolated MAP data from 42 
GCMs in the CMIP5 and CRU datasets were used in this study. 

Considering the length of the observed data in the CMIP5 and CRU datasets, the period of 
1961-1999 could be used as the calibration and verification period. Hence, each time series was 
divided into two parts for both MAT and MAP in accordance with the two stages in the 
model-building process. The first stage (1971-1999: 348 months) was used for calibration and the 
second stage (1961-1970: 120 months) was used for verification using the delta, physical scaling 
(SP) and artificial neural network (ANN) methods, with the bi-linear interpolated CMIP5 and CRU 
data. During the process of calibration and verification, only the interpolated CMIPS and CRU data 
obtained by the bi-linear interpolation method were used to represent local station data and this 
approach was referred to as "No method". Therefore, during each stage, 168 calibrated and verified 
MAT time series and 172 calibrated and verified MAP time series were obtained for each station. 


3 Methods 


Many methods and approaches were considered for use in this study. Figure 2 shows a flow chart 
of the methods and approaches that were considered. 


Interpolated Observed MAT/MAP 
MAT/MAP data || from meteorological 
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CMIP5 datasets ||TMR 
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Interpolated Observed MAT/MAP 
MAT/MAP data | |from meteorological 
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Delta ANN 
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Performance comparison by 
traditional assessment 
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Uncertainty analysis 
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Fig. 2 Flow chart of methods and approaches used in this study. MAT, monthly average temperature; MAP, 
monthly accumulated precipitation; CRU, Climate Research Unit; CMIP5, Coupled Model Inter-comparison Project 
phase 5; TMR, Tianshan Mountains region; SP, physical scaling; ANN, artificial neural network; RSPA, revised set 
pair analysis. 


JOURNAL OF ARID LAND 


3.1 Delta method 

The delta method is a simple statistical downscaling approach that has been widely applied in 
previous assessments of climate change and water planning studies (Hay et al., 2000; Zhao and Xu, 
2007; Hao et al., 2009; Li et al., 2013). The method has been described in previous literature (Li et 
al., 2013). In this study, the delta method was extended to use in climate reconstruction as follows: 


T(x,t) =Tb(x,0) + (Tr(y,t) -Tbr(y,0)), (1) 
=. (PD 
P(x,t) = Pr(y,t) Oy" (2) 


where t denotes the number of the month in the year, t=1, 2, 3, ..., 12; T(x, t) (°C) and P(x, t) (mm) 
are the reconstructed MAT and MAP for the target location x in the t month during the 
reconstruction period, respectively; Tr(y, t) and Pr(y, t) represent gridded values of MAT and MAP 


for the reference location y in the t month during the reconstruction period, respectively; Tb(x,t) 


and Tbr(y,t) (*C) are the mean values of MAT during the baseline time period for the target 


location x and the reference location y in the tè month, respectively; and Pb(x,t) and Pbr(y,t) 


(mm) are the mean values of MAP during the baseline time period for the target location x and the 
reference location y in the t month, respectively. 


3.2 Physical scaling (SP) method 


To overcome the limitation of location-centric characteristics and model geophysical changes 
within a statistical downscaling framework, Gaur and Simonovic (2017a, b) proposed the SP 
model. The model was based on the hypothesis that the large-scale climate and elevation properties 
of a location can define the local-scale climate characteristics. The SP model builds a regression 
relationship between local-scale observed data and bi-linearly interpolated climate model and 
elevation data. A generalized additive model (GAM) (Schimek and Turlach, 2000) was chosen as 
the regression function to link predictant (local-scale observed data) and predictor variables 
(bi-linearly interpolated climate model and elevation data). In this study, the SP method was 
extended to climate reconstruction and was mathematically expressed as: 


g(MAT,,.) = By + (MAT) + f(E,), (3) 
g(MAP,,,) = By + f,(MAP noa) + (Ep) » (4) 


where variables g and f represent the link and smoothing functions, respectively; MATops (°C) and 
MAPovs (mm) represents the observed MAT and MAP data, respectively; subscript obs and mod 
denote if the climatic data are observed or model based, respectively; Bo denotes the regression 
parameter; E, denotes the elevation (m) of the referred pixel, and the subscript p means that the 
data used are at a pixel scale; and MAT moa and MAPmoa denote large-scale MAT and MAP data 
obtained by a bi-linear interpolation of GCM or CRU data at the referred pixel, respectively. In this 
study, smoothing functions were fitted using the penalized likelihood maximization algorithm, 
which was a variant of the maximum likelihood estimation algorithm that was obtained by 
incorporating a penalty function, with a tradeoff between model fit wiggliness and goodness of fit 
(Wood, 2000). 


3.3 Artificial neural network (ANN) model 


Artificial neural networks (ANNs) have been widely used in the estimation of climatic, hydrologic, 
economic and other variables in recent decades. In this study, a three-layered feed-forward neural 
network (FFNN) was used. Three layers containing an input layer, a hidden layer and an output 
layer were included within a general framework for representing nonlinear functional mapping 
between a set of input and output variables (Nourani et al., 2009). The weighted connections feed 
activations in an FFNN only in the forward direction from an input layer to an output layer. In 
addition, additional weighted connections were used to feed the previous activations back to the 
network in a feed-forward back propagation (BP) network (Srinivasulu and Jain, 2006). In a FANN 
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structure with a BP algorithm, it has been proven that the three-layer structure is satisfactory for 
forecasting and simulating water resources, hydrologic parameters and climate change (ASCE Task 
Committee on Application of Artificial Neural Networks in Hydrology, 2000). The 
Levenberg-Marguardt (LM) fitting algorithm can be used to train a three-layered BP FFNN 
(TLBP-FFNN) as follows: (1) suppose a set of n empirical pairs of independent and dependent 
variables are defined as (xi, yi); and (2) in the model curve f(xi, £), the parameter f is optimized so 
that the sum of the squares of the deviations (S(£); Eq. 5) obtains a minimum value (Levenberg, 
1944; Marguardt, 1963). 


S(B)= >, [ly - f(x, BP. (5) 


A heuristic procedure is used to determine the input dimension and the number of hidden nodes 
by testing different inputs, with an increasing number of hidden nodes (Cannas et al., 2006). The 
output value of the TLBP-FFNN could be specifically expressed as follows: 


Yk = A DXVA x fi Wim tw.) + Wol» (6) 


where yx represents the k" computed output variable; m and n are the number of neurons in the 
input and hidden layers, respectively (i=1, 2, 3, ..., m and j=1, 2, 3, ..., m; fo and fh are the 
activation functions for the output and hidden neurons, respectively; wi is the k™ neuron in the 
output layer, with the j neuron in the hidden layer; wj; is a weighting linking the j® neuron in the 
hidden layer with the i! neuron in the input layer; x;stands for the i input variable for the input 
layer; and wjo and wo are the biases for the j hidden neuron and the k" neuron in the output layer, 
respectively. 

In this study, MAT and MAP were modelled by the TLBP-FFNN method, with the linear 
activation function in the output neuron and Tan-sigmoid activation function in the hidden neuron 
trained using the LM fitting algorithm. For the ANN, which is a data-driven statistical model, the 
input variables were selected from the available data, and the model was subseguently developed. 
The MAT and MAP during 1961—1999 were available from the 53 meteorological stations. 

The LM back propagation scheme was used as the training algorithm in the Matlab neural 
network toolbox (Beale et al., 2015) for the purpose of this study. The network architecture and 
training iteration number (epoch) are important in the TLBP-FFNN modeling. In this study, 10,000 
epochs resulted in a satisfactory training network, with 10 * as the target performance. 


3.4 Performance and uncertainty analysis 


3.4.1 Traditional performance assessment indices 
The following performance indices were used to assess the reconstruction performance: 


n "\2 
NSC =1- Zia = , (7) 
2.0 =y) 
" (yy — y,)x100% 
PBLAS = Þa 32 9 N (8) 
iði 


where NSC and PBIAS are the Nash-Sutcliffe coefficient (Nash and Sutcliffe, 1970) and 
percentage bias, respectively; and n, yi, y, and y are the number of observations, the observed 


value, the predicted or reconstructed value and the mean value of the observed data, respectively. 
Both NSC and PBIAS were used to assess the performance of models. When the value of NSC (—co 
to 1.00) is close to 1.00 and the absolute value of PBIAS is close to O, the estimated or 
reconstructed data series is close to the observed data series (Gupta et al., 1999; Moriasi et al., 
2007). A positive PBIAS indicates that the model is underestimating the predicted values, while a 
negative value means that the model is overestimating the predicted values (Gupta et al., 1999). 
Values of NSC greater than 0.50 and absolute values of PBIAS less than 25% indicate a 
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satisfactory model performance (Table 1). 


Table 1 Climate reconstruction performance criteria (revised from Moriasi et al. (2007)) 


Performance rating Grade NSC PBLAS 
Very good A NSC>0.75 PBIAS< +15% 
Good B 0.65<NSC<0.75 PBIAS< +20% 
Satisfactory C 0.50<NSC<0.65 PBIAS< +25% 
Unsatisfactory D NSC<0.50 Any value 


Note: NSC, Nash-Sutcliffe coefficient; PBIAS, percentage bias. 


3.4.2 Set pair analysis (SPA) theory 

The SPA theory was proposed by Zhao (1989), which provides a way to assess uncertainties. This 
theory overlaps with many other uncertainty theories, especially Boolean reasoning methods, 
evidence theory, fuzzy set theory and rough set theory (Yang et al., 2008). Emphasizing the 
relativity and fuzziness in information processing, the SPA theory can identify the relative certainty 
and relative uncertainty, and describe them quantitatively and systematically using identity, 
discrepancy and contrary (Zhao and Xuan, 1996; Ye and Wu, 2000; Wang and Li, 2012; Yan et al., 
2017). The SPA theory has been widely applied in many fields, including artificial intelligence, 
system engineering, multi-attribute assessments, urban ecosystem health assessments, energy 
consumption forecasting, water resource evaluation, weather forecasting and hazard assessment 
(Wang et al., 2004; Zhou et al., 2007; Wang et al., 2008; Su et al., 2009; Wang and Li, 2012; Wang 
et al., 2014; Yan et al., 2017). 

The certainty and uncertainty in a given system can be evaluated by the SPA theory. Assuming A 
and B are the sets given, the set pair H=(A, B), made up of A and B, is determined by its 
characteristics. The value of its characteristics is represented by N. S, F and P represent the values 
of the identity, discrepancy and contrary characteristics, respectively. The values of S, F, P and N 
meet the condition N=S+F+P. Therefore, the values of S/N, F/N and P/N are termed the identity, 
discrepancy and contrary degrees, respectively. In this study, we set a=S/N, b=F/N and c=P/N. 
Obviously, a+b+c=1 is undisputed (Wang and Li, 2012; Yan et al., 2017). The connection degree of 
H is then defined as follows: 


S F. P, ei at 
L(A, B) N N N a+bi+cj, (9) 
where „ is the connection degree of H; i is the uncertainty coefficient of the discrepancy degree, 
with an uncertainty value between —1 and 1, which corresponds to various circumstances or can be 
considered only as a measure of discrepancy; and j is the coefficient of the contrary degree and is 
specified as —1 (Su et al., 2009; Wang and Li, 2012; Yan et al., 2017). 
Equation 9 is considered to be the three-element connection degree (Wang and Li, 2012; Yan et 
al., 2017). The multi-element connection degree is defined as follows: 


MA, B)=a+ bi +b,i, +---+b, ip tc), (10) 
where a+b1+b2+...+bx2+c=1; bi, bz, ..., br are components of the discrepancy degree, explaining 
different grades of discrepancy; and ij, i2, ..., ix-2 are the uncertainty component coefficients of the 


discrepancy degree (Wang and Li, 2012). The connection degree „(A, B) may overcome the 
drawbacks of using a single index, such as the correlation coefficient, subordinate degree or grey 
correlation degree (Wang and Li, 2012; Yan et al., 2017). 

3.4.3 Revised set pair analysis (RSPA) for climate reconstruction 

Based on the principle and operating procedure of the SPA, a revised SPA, namely the RSPA, was 
proposed to assess the uncertainty of short-term climate reconstruction. We assumed that A=(y,, 
Y2, ...; Ym) was the reconstructed climatic time series, and B=(x1, x2, ..., Xm) was the observed 
climatic time series (m is the number of observed values) in this study. For the performance 
assessment of k (k>2) grades of climate reconstruction, there are k-1 thresholds (si, S2, ..., Sk-1) 
whose intervals (0—s1, s1—52, ..., Sk-2—Sk-1, Sk-1-+°°) correspond to different grades. For example, 
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k=3 means three grades, namely good, normal and bad performance (Wang and Li, 2012; Yan et al., 
2017). 
The k-element connection degree „(A, B) is described as follows: 


1+ Oi, + Oi, +---+0i,_, +0j, when A, e|0,s, | 
2) + Oi, +--+ 0i, #0), when A, € [0,5,] 
m m 
á 5 45 4200406, , +0), when A, € [0,5] 
H(A,B)=41 M M ; (11) 
S&S S, i, S, i, vu Ska k Oj, when A, € [0, Spa] 
m m m 
A jj 44 HA] +S j, when A, e[0,+0) 
m m m m m 
where A, =|y,-x,|, /=1, 2, 3, ..., m, and s1<s2<...<sw1. 
5, = 3)" sen(A, - 5); sEn(A, =s) =? "458 (12) 
= sgn(A, — S, ); sgn(A, — 5,) = E 
: 1 SBMA, = SSB i 0, else 
m 1, S4 <A, SS 
Sa =$," sgn(A, - s4); d =2, 3, 4, ..., k—1; sa-s) ii k (13) 
S=m-S-$,--.-S,,. (14) 


In this study, the absolute error (A,) between the calibrated (or verified) and observed climatic 
time series was chosen as the assessment factor. S1/m is the possibility of A, belonging to Gi grade; 
S2/m is the possibility of A, belonging to G2 grade; S3/m is the possibility of A, belonging to G3 
grade; Sk 1/m is the possibility of A, belonging to the (k-1)* grade; and S;/m is the possibility of A, 
belonging to the Gx grade. For k=5, it means that there are five grades, i.e., very good performance 
(G1), good performance (G2), normal performance (G3), bad performance (G4) and very bad 
performance (Gs). The grade standards for MAT and MAP reconstruction are shown in Table 2. 
According to the SPA theory, S;/m corresponds to the identity degree; Ss/m corresponds to the 
contrary degree; and S2/m, S3/m and S4/m are three detailed discrepancy degrees. 


Table 2 Assessment of grade standards for the calibration and verification of monthly average temperature (MAT) 
and monthly accumulated precipitation (MAP) time series in the Tianshan Mountains region (TMR) of China 


Grade 
Os, S1-S2 S2—-S3 S3-Sa S4-Ss 
Range of MAT (°C) 0.0—0.5 0.5—1.0 1.0-1.5 1.5-2.0 2.0—+0 
Range of MAP (mm) 0.0—2.0 2.0—4.0 4.0—6.0 6.0—8.0 8.0—+0 
Performance Very good Good Normal Bad Very bad 
Degree G1 G2 G3 G4 G5 


In addition, we set the ratio of R:R/ representing different calibrated (or verified) climatic time 
series for one target station based on interpolated grid data from the CMIP5 and CRU datasets 
using four different reconstruction methods: delta, ANN, SP and No methods. Here, i means 
different data sources (GCMs from the CMIP5 and CRU) and j means different reconstruction 
methods (delta, ANN, SP and No methods). For example, Rċmcm, represents a calibrated (or 
verified) climatic time series for one target station based on interpolated grid data from the 
CanCM4 GCM model using the ANN method. In this study, we set T representing the observed 
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MAT time series for one target station in the TMR, and P representing the observed MAP time 
series in the TMR. The connection degree (R, T) and „{R, P)) between calibrated (verified) and 
observed data can be calculated using Equations 11-14. 


4 Results 


In this section, we assessed the performance of calibrated and verified MAT and MAP time series, 
taking Zhaosu station as an example. 


4.1 Performance assessment of calibrated and verified MAT time series 


Figure 3 shows the calibration and verification performance of MAT time series by the delta, SP, 
ANN and No methods based on the CMIP5 and CRU datasets at Zhaosu station. The results 
indicated that 128 satisfactory calibrated and verified MAT time series were produced from a total 
of 168 time series, with a satisfaction degree of 76%. All 128 satisfactory calibrated and verified 
MAT time series captured the intra-annual variation pattern (Fig. 3). Numerically, the calibration 
and verification based on two sources of data performed very well (most performance grades were 
A (very good)) for calibrated and verified MAT time series). Use of the three methods (delta, SP 
and ANN) greatly improved the calibration and verification performance. There was no large 
difference between calibrated (verified) and observed MAT values, but there was a small difference 
when the different sources of data were used (Fig. 3). Calibrated and verified results based on grid 
data from the CRU were better than those generated using the CMIP5. This may be because the 
CRU data were superior to the CMIP5 data. The CRU data were obtained based on actual 
pointwise observations, while the CMIP5 data were modelled based on greenhouse gases, 
anthropogenic aerosol solar variability and volcanic eruptions. 

The same data sources also generated similar MAT time series using the different methods at 
Zhaosu station during the calibration and verification periods. There was almost no difference 
between calibrated (verified) and observed MAT time series based on the CRU data when using the 
four different methods (Fig. 3). Based on the CMIP5 dataset, we obtained a similar calibration and 
verification performance by the four methods according to the average NSC value. However, 
according to the PBIAS values, MAT values were underestimated or overestimated by the SP and 
ANN methods for both data sources during the calibration and verification periods. 

The numbers of satisfactory calibrated and verified MAT time series based on the CMIP5 and 
CRU datasets using the delta, SP and ANN methods in the TMR are presented in Figure 4 and 
Table S3 (see the Appendix). It can be seen that for all 53 meteorological stations selected in this 
study, satisfactory calibrated and verified MAT time series were obtained. The performance ratings 
were high, with most of them graded A (very good). Both the CMIP5 and CRU datasets performed 
very well during the MAT calibration and verification periods. The delta and SP methods produced 
satisfactory results based on both the CMIP5 and CRU datasets for all 53 meteorological stations. 
The ANN method produced a satisfactory calibrated and verified MAT time series for only 3—4 
stations based on several GCMs from the CMIP5 and CRU datasets. Some bi-linear interpolated 
CMIP5 models and CRU data could generate satisfactory calibrated and verified MAT time series 
for most stations without the use of a reconstruction method. 


4.2 Performance assessment of calibrated and verified MAP time series 


The calibration and verification performance of MAP time series produced by the delta, SP, ANN 
and No methods using the CMIP5 and CRU datasets at Zhaosu station is shown in Figure 5. In 
comparison with the calibration and verification results of MAT time series, the calibration and 
verification performance of MAP time series was inferior. Specifically, only 26 satisfactory 
calibrated and verified MAP time series among the total of 172 time series, with a satisfaction 
degree of 15%. Only 14 GCMs from the CMIP5 dataset produced a good or satisfactory calibrated 
and verified MAP time series (most performance grades were C (satisfactory)) at Zhaosu station, 
while the use of the CRU dataset did not result in any satisfactory series. The different GCMs from 
the CMIP5 dataset generated different performances, each with small differences (Fig. 5). 
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The same data source produced different results when the different methods were used at Zhaosu 
station during the calibration and verification periods. Among the 26 satisfactory calibrated and 
verified MAP time series, nine were generated by the delta method, nine by the SP method, six by 
the ANN method and two by the No method. Although all 26 satisfactory calibrated and verified 
MAP time series could capture the intra-annual pattern, there were numerical differences between 
them and the observed time series (Fig. 5). The average NSC value for calibrated and verified MAP 
time series showed a decreasing trend with the order of delta, SP, ANN and No methods. Therefore, 
for the calibrated and verified MAP time series at Zhaosu station, the delta method performed 
better than the SP method, and the SP method performed better than the ANN method. 
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Fig. 3 The 128 satisfactory calibrated, verified and observed monthly average temperature (MAT) time series at 
Zhaosu station. Obs, observed data; SP, physical scaling; ANN, artificial neural network. Delta. ACCESS1-0, the 
reconstructed data based on ACCESS1-0 data using the delta method; the others all follow this nomenclature 
scheme. 
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Fig. 4 Distribution of numbers of satisfactory calibrated and verified MAT time series in the TMR. The value at 
each station represents the number of satisfactory calibrated and verified MAT time series based on the CMIP5 and 
CRU datasets using the delta, SP and ANN methods. 
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Fig.5 The 26 satisfactory calibrated, verified and observed monthly accumulated precipitation (MAP) time series 
at Zhaosu station. Obs, the observed data; SP, physical scaling. DeltaaACCESS1.0, the reconstructed data based on 
ACCESS1.0 data using the delta method; the others all follow this nomenclature scheme. 


The number of satisfactory calibrated and verified MAP time series based on the CMIP5 and 
CRU datasets using the delta, SP, ANN and No methods in the TMR are shown in Figure 6 and 
Table S4 (see the Appendix). Satisfactory calibrated and verified MAP time series were obtained 
for only 20 stations in the TMR. The performance ratings were low and most of them were graded 
C (satisfactory). Although the number of satisfactory calibrated and verified MAP time series was 
very small (1, 2 or 3) at most of the 20 stations, it was slightly larger at Zhaosu station (26), 
followed by Bayanbulak (13), Daxigou (8) and Xiaoquzi (7) stations. With the exception of 
Zhaosu, Bayanbulak, Daxigou and Xiaoquzi stations, satisfactory MAP time series were only 
obtained with the CRU dataset. The delta and SP methods produced more satisfactory MAP time 
series than the ANN method. The ANN method produced satisfactory calibrated and verified MAP 
time series for 12 stations based on data from several GCMs in the CMIP5 and CRU datasets. The 
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CRU data produced satisfactory calibrated and verified MAP time series for three stations, while 
two GCMs from the CMIP5 dataset produced satisfactory calibrated and verified MAP time series 
for one station by the No method. 


75*E 80°E 85°E 90°E 95°E 

1 l 
7 E 
a Ä af p ka + 

a | YY 
aa! A j3 X 4! 
26 = a im 
l a! 
A 3 
A 
0 250 500 km 
Zz 1 N N J £ 
3 A3 Legend + 
A Station with satisfactory calibrated and verified MAP time series 


T T T 
75°E 80°E 85°E 90°E 9S°E 


Fig. 6 Distribution of numbers of satisfactory calibrated and verified MAP time series in the TMR. The value at 
each station represents the number of satisfactory calibrated and verified MAT time series based on the CMIP5 and 
CRU datasets using the delta, SP and ANN methods. 


The above results indicate that the delta and SP methods performed very well during the 
calibration and verification periods of MAT time series for both the CMIP5 and CRU datasets at 
each station in the TMR. The ANN method provided a satisfactory MAT time series for several 
stations. For 40% of the selected stations in the TMR, good or satisfactory MAP time series were 
obtained by the delta and SP methods using several GCMs from the CMIP5 dataset. The ANN 
method generated satisfactory MAP time series at fewer stations than the delta and SP methods 
based on several GCMs from the CMIP5 and CRU datasets. One possible explanation for the 
difference was that the temperature was impacted mainly by the radiation balance and was easier to 
simulate. However, precipitation may be influenced by many factors, such as local climate, 
topography and vapor sources and was much more difficult to model. 


5 Discussion 


In the previous section, the best results were obtained for calibrated and verified MAT and MAP 
time series at Zhaosu station. Selecting Zhaosu station as an example again, this section presents 
the uncertainty determined during the calibration and verification of MAT and MAP time series 
using the RSPA method. Connection degree expressions between calibrated (verified) and observed 
climatic (MAT and MAP) time series by the delta, SP, ANN and No method based on the CMIP5 
and CRU datasets at Zhaosu station in the TMR are provided in Tables S5 and S6 (see the 
Appendix). It should be noted that those two tables only show the first 20 entries by descending 
value of identity degree (a). 


5.1 Uncertainty analysis of calibrated and verified MAT 


For the set pair (Réxt, T), the identity degree (a) was 0.58 and 0.59 for the calibrated and verified 
MAT, the uncertainty component coefficients of the discrepancy degree (bi, b2 and bs) were 0.24, 
0.11 and 0.04 for the calibrated MAT, and 0.26, 0.06 and 0.06 for the verified MAT, respectively; 
and the contrary degree (c) was 0.03 and 0.03 for the calibrated and verified MAT, respectively 
(Table S5). Therefore, (Rör, T)=0.58+0.24i,+0.11i2+0.04i3+0.03j during the calibration period, 
which means that the connection degree between calibrated and observed MAT time series for 
Zhaosu station based on the CRU dataset using the delta method was expressed as: /MR, 
T)=0.58+0.24i,+0.1112+0.04i3+0.03j, i.e., 58%, 24%, 11%, 4% and 3% of 348 values of A, during 
the calibration period fell into the ranges of 0.0°C—0.5°C, 0.5°C-—1.0°C, 1.0°C-—1.5°C, 1.5°C-—2.0°C 
and 2.0*C—+oo, respectively. The rest of the results can be explained in the same manner. 
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According to the criteria in Table 2, the calibrated and verified MAT time series based on the 
CRU dataset obtained by the delta method produced a very good performance. There was a high 
possibility (51%—59%) of a very good performance during both the calibration and verification 
periods. For the time series based on the GCM data from the CMIP5 dataset, there was a lower 
possibility (20%—26%) of a very good performance compared with the possibility (30%-39%) of a 
very bad performance. The uncertainty component coefficients of the discrepancy degree based on 
both datasets decreased gradually from bi to bs. Almost all values of bi were larger than values of 
bz and bs. This indicates that the possibility shifted from the discrepancy to the identity was larger 
than that from the discrepancy to the contradistinction. Values of (Ri, T), “(Renu T) and KR, 
T) are positive if we let i1=0.5, i2=0.0, i3= —0.5 and j= —1.0. A larger positive value means that the 
calibrated (verified) and observed time series are more similar and the uncertainty is lower. 
Therefore, the lowest uncertainty for the reconstruction of MAT was based on the CRU data using 
the three methods (delta, SP and ANN). 

Different methods based on the same data source can generate different results for the MAT. The 
CRU dataset showed that „(Réxs, T)>u(Röru, T)>u(Reru , T) and the CMIP5 dataset showed that „(R 
xxx, 1)>U(Ryxx, T)>u(Rsxx, T)>u(Ryxx, T) (here XXX means any dataset from CMIP5) on average if 
i1=0.5, i,=0.0, i3= -0.5 and j= —1.0. In summary, the SP method produced 11 entries, and ANN and 
delta methods produced seven and two entries, respectively, among the first 20 entries. 

The numbers of positive connection degrees for calibrated and verified MAT time series based 
on the CMIP5 and CRU datasets using the delta, SP and ANN methods in the TMR are shown in 
Table S7 (see the Appendix). It can be seen that 51 stations can obtain a positive connection degree 
if i1=0.5, iz=0.0, i3= —0.5 and j= —1.0. The CRU dataset produced low levels of uncertainty for 
many more stations than the CMIP5 dataset. The delta and SP methods produced low levels of 
uncertainty based on the CRU dataset and several GCMs from the CMIP5 dataset for all 51 
stations. The ANN method could not produce good calibration and verification results for MAT. 

Therefore, using the CRU dataset, we can obtain the best performance, with the lowest 
uncertainty for MAT; while we can acquire a good performance with higher levels of uncertainty 
for MAT, based on most GCMs from the CMIP5 dataset. The delta and SP methods were more 
robust in the calibration and verification of MAT time series. 


5.2 Uncertainty analysis of calibrated and verified MAP 


For the set pair (Rèu, P), the identity degree (a) was 0.22 and 0.23 for the calibrated and verified 
MAP; the uncertainty component coefficients of the discrepancy degree (bi, b2 and bs) were 0.16, 
0.10 and 0.06 for the calibrated MAP, and 0.10, 0.07 and 0.08 for the verified MAP, respectively; 
and the contrary degree was 0.46 and 0.43 for the calibrated and verified MAP, respectively. 
Therefore, (Rew, P)=0.22+0.16i1+0.10i2+0.06i3+0.46j during the calibration period, which means 
that the connection degree between calibrated and observed MAP time series for Zhaosu station 
based on the CRU dataset using the delta method was expressed as: „ÁR, 
P)=0.22+0.16i1+0.10i2+0.06i3+0.46j, i.e., 22%, 16%, 10%, 6% and 46% of 348 values of A, during 
the calibration period fell into the ranges of 0-2, 2—4, 4-6, 6-8, and 8 mm—+o, respectively (Table 
S6, see the Appendix). The rest of the results can be explained in the same manner. 

According to the criteria in Table 2, calibrated and verified MAP time series based on the CRU 
dataset by the delta method produced a very good performance. There was a 22%-23% possibility 
of a very good performance during both the calibration and verification periods. For some GCMs 
from the CMIP5 dataset, the possibility of providing a very good performance (10%-18%) was 
much lower than that of making a very bad performance (49%-67%). As for MAP, the uncertainty 
component coefficients of the discrepancy degree for MAP based on both datasets decreased 
gradually from bı to ba. Almost all values of bı were larger than the values of bz and bs. This 
indicates that the possibility shifted from the discrepancy to the identity was larger than that from 
the discrepancy to the contradistinction. All 20 connection degree values were negative, with „(R 


delta 


cru, P) having the largest value, if we set i1=0.5, i2=0.0, i3= —0.5 and j= —1.0. Therefore, using the 
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CRU dataset by the delta method, we can obtain the best performance, with the lowest uncertainty. 

Different methods based on the same data source can also generate different results for the MAP. 
The CRU dataset showed that (Rex, P)>M(Röru, P) if i1=0.5, i2=0.0, i= —0.5 and j= —1.0, while the 
CMIP5 dataset showed that „(Rxxx, P)>u(Rixx, P)>u(Rxxx, P)>u(Rxxx, P) (here XXX means any 
dataset from CMIP5) on average if i1=0.5, i2=0.0, i3= —0.5 and j= —1.0. In summary, the delta 
method produced 16 entries, and the ANN and No method produced one and three entries, 
respectively. 

The number of positive connection degrees for calibrated and verified MAP time series based on 
the CMIP5 and CRU datasets using the delta, SP and ANN methods in the TMR are shown in Table 
S8 (see the Appendix). From the results it can be seen that 33 of the stations selected in this study 
obtained a positive connection degree if i1=0.5, i2=0.0, i3= —0.5 and j= —1.0. The CRU dataset 
produced low levels of uncertainty for more stations than the CMIP5 dataset. The delta, SP and 
ANN methods produced low levels of uncertainty based on both the CMIP5 and CRU datasets for 
all 33 stations. However, the delta method performed slightly better than the SP method while the 
SP method was better than the ANN method. For some stations, the use of the CMIP5 GCMs and 
CRU dataset could produce calibrated and verified MAP time series with low levels of uncertainty 
by the No method. 

The best performance, with the lowest level of uncertainty, was obtained for MAP using the 
CRU dataset, while the performance based on model data from the CMIP5 dataset was not very 
good, with higher levels of uncertainty. The Zhaosu station is not shown in Table S8, which 
indicates that the satisfactory calibrated and verified MAT or MAP time series, as determined by 
traditional performance indices, did not always result in low levels of uncertainty for a given range 
of absolute error. 


5.3 Uncertainty comparison between reconstructed MAT and MAP 


Although this study used the latest grid data from the CRU and CMIP5 datasets and a range of 
reconstruction methods, the reconstruction of MAT was obviously superior to that of MAP. This 
was a similar outcome to that of a previous study by Li et al. (2013). The reconstruction of MAT 
based on the CRU and CMIP5 datasets and using the three reconstruction methods could capture 
not only the pattern of intra-annual changes but also the actual values, especially the extreme 
values, with low levels of uncertainty. For the reconstruction of MAP, the intra-annual pattern of 
variation could be captured, but the detailed description was poor in terms of actual values. It could 
not reconstruct the extreme monthly precipitation in one year. Large levels of uncertainty existed 
during the reconstruction of MAP based on the CRU and CMIP5 datasets using the three 
reconstruction methods. 

The temperature was impacted mainly by the radiation balance, which was apparent from the 
CRU dataset and GCMs in the CMIP5 dataset. In contrast, precipitation was influenced by many 
factors such as local climate, topography and vapor sources and was difficult to model using the 
GCMs in the CMIP5 dataset compared to the CRU dataset. Therefore, the reconstructed time series 
of MAT could be used for the long-term (from 1850 or 1901 to now) assessment of regional 
climate change in the TMR. 


6 Conclusions 


By collecting available observed data from meteorological stations in the TMR and grid data from 
the CRU and the CMIP5 datasets, this study calibrated and verified MAT and MAP in the TMR 
mainly using the delta, SP and ANN methods. The reconstruction performance and uncertainty 
from different data sources and methods were assessed and compared using traditional performance 
assessment indices and the RSPA method. The delta and SP methods resulted in very good and 
similar performances for calibrated and verified MAT time series in the TMR based on the CRU 
and CMIP5 datasets, according to performance indices. The performance based on the grid data 
from the CRU dataset was better than that based on the data from the CMIP5 dataset. The 
calibration and verification of MAP did not indicate a very good performance compared to the 
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results of MAT. The ANN method was more suitable for the reconstruction of MAP. However, for 
a satisfactory calibrated and verified MAT or MAP time series, an assessment by performance 
indices did not always indicate a low level of uncertainty at a given range of absolute error 
according to the RSPA method. Most satisfactory calibrated and verified MAT time series and 
several MAP time series captured the characteristics of their annual periodicity, distribution or 
pattern well for the targeted stations. The accuracy of values could not be guaranteed for a given 
range of absolute error. The RSPA method described the numerical accuracy in more detail 
compared to traditional indices. It is recommended to combine the traditional performance indices 
with the RSPA method in gualitative and guantitative assessments of climate reconstruction. 

This study was a new attempt to calibrate and verify climatic variables (MAT and MAP) using 
the SP method. The use of the SP method produced a comparable performance to the delta method. 
Therefore, it can be applied to simulate or reconstruct climatic variables due to its easy 
implementation. In future work, the data and methods used in this study that produced satisfactory 
time series, with low levels of uncertainty (for MAT), could be applied to the period of 1901—1960 
based on the CRU dataset and the period of 1850-1960 based on the CMIP5 dataset for the 
assessment of regional climate change in the TMR at the century scale. 
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Appendix 
Table S1 Descriptive information of the meteorological stations selected in the Tianshan Mountains region (TMR) 
No. Station name Station code Latitude Longitude Elevation (m) Time period 
1 Akgi AKO 78.45°N 40.93°E 1985.700 1961-201 
2 Aksu AKS 80.23°N 41.17°E 1104.730 1961-201 
3 Baicheng BCH 81.90°N 41.78°E 1230.000 1961-201 
4 Balguntay BLGT 86.30°N 42.73*E 1739.470 1961—201 
5 Barkol BKL 93.05°N 43.60°E 1675.370 1961-201 
6 Bayanbulak BYBLK 84.15°N 43.03°E 2459.270 1961-201 
7 Bole BL 82.07°N 44,90°E 532.567 1961-201 
8 Caijiahu CJH 87.53°N 44,20°E 440.967 1961-201 
9 Dabancheng DBCH 88.32°N 43.35°E 1104.770 1961-201 
10 Daxigou DXG 86.83°N 43.10°E 3539.000 1961-201 
11 Gongliu GL 82.23°N 43.47°E 775.970 1961-201 
12 Hami HM 93.52°N 42.82°E 738.200 1961-201 
13 Hejing HJ 86.40°N 42.32°E 1102.370 1961-201 
14 Hoxud HX 86.80°N 42.25°E 1086.130 1961-201 
15 Hutubi HTB 86.82°N 44.13*E 522.970 1961—201 
16 Jiashi JSH 76.73*N 39.50°E 1208.600 1961-201 
17 Jimsar JMS 89.17°N 44.02°E 735.570 1961-201 
18 Jinghe JH 82.90°N 44.60°E 320.870 1961-201 
19 Kalpin KP 79.05°N 40.50°E 1162.630 1961-201 
20 Kashi KSH 75.99°N 39.27*E 1291.200 1961—201 
21 Khorgas KG 80.42°N 44,20°E 772.970 1961-201 
22 Korla KL 86.13°N 41.25°E 932.433 1961-201 
23 Kumux KM 88.22°N 42.23*E 923.470 1961—201 
24 Kuga KO 82.97°N 41.73°E 1082.930 1961-201 
25 Manas MNS 86.20°N 44,32°E 471.400 1961-201 
26 Mori MR 90.28°N 43.83°E 1271.970 1961-201 
27 Mosuowan MSW 86.10°N 45.02°E 347.230 1961-201 
28 Naomaohu NMH 95.13°N 43.77°E 479.030 1961-201 
29 Nilka NLK 82.62°N 43.80°E 1106.130 1961-201 
30 Paotai PT 85.25°N 44.85%E 337.100 1961—201 
31 Oapgal QPQ 81.15°N 43.83°E 603.800 1961-201 
32 Qitai QT 89.57°N 44,02°E 794.100 1961-201 
33 Shanshan SHSH 90.23°N 42.85°E 398.930 1961-201 
34 Shawan SHW 85.62°N 44.33°E 522.200 1961-201 
35 Shihezi SHHZ 86.05°N 44.32°E 443.730 1961-201 
36 Shisanjianfang SHSJF 91.73°N 43.22°E 722.900 1961-201 
37 Tekes TKS 81.77°N 43.18°E 1210.530 1961-201 
38 Tianchi TCH 88.12°N 43.88°E 1942.170 1961-201 
39 Toksun TKS 88.63°N 42.80°E 1.670 1961-201 
40 Tuergate TEGT 75.40°N 40.52°E 3506.400 1961-201 
41 Turpan TP 89.20°N 42.93°E 34.970 1961-201 
42 Urumgi URMO 87.65°N 43.78°E 935.670 1961-201 
43 Usu US 84.67°N 44.43*E 478.970 1961—201 
44 Wenguan WEQ 81.02°N 44.97*E 1358.600 1961—201 
45 Wugia WUO 75.25°N 39.72°E 2176.900 1961-201 
46 Wushi WSH 79.23°N 41.22°E 1396.470 1961-201 
47 Xiaoguzi XQZ 87.10°N 43.49°E 1873.800 1961-201 
48 Xinhe XH 82.61°N 41.53°E 1014.200 1961—201 
49 Xinyuan XY 83.30°N 43.45°E 929.200 1961-201 
50 Yangi YO 86.57°N 42.08°E 1056.600 1961-201 
51 Yining YN 81.33*N 43.95°E 663.200 1961-201 
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52 Yiwu YW 94.70°N 43.27°E 1728.600 1961-2011 
53 Zhaosu ZHS 81.13°N 43.15°E 1853.400 1961-2011 
Table S2 Basic information of the CMIP5 (Coupled Model Inter-comparison Project phase 5) models 
No. Model name Source and country Resolution Historical period 
1 ACCESS1-0 CSIRO-BOM, Australia 192x145 Jan 1850-Dec 2005 
2 ACCESS1-3 CSIRO-BOM, Australia 192x145 Jan 1850-Dec 2005 
3 BCC-CSM1-1 BCC, China 128x64 Jan 1850-Dec 2012 
4 BCC-CSM1-1-M BCC, China 320x160 Jan 1850-Dec 2012 
5 BNU-ESM GCESS, China 128x64 Jan 1850-Dec 2005 
6 CanCM4 CCCMA, Canada 128x64 Jan 1850-Dec 2005 
7 CanESM2 CCCMA, Canada 128x64 Jan 1850-Dec 2005 
8 CCSM4 NCAR, U.S. 288x192 Jan 1850-Dec 2005 
9 CESM1-BGC NSF-DOE-NCAR, U.S. 288x192 Jan 1850-Dec 2005 
10 CESM1-CAM5 NSF-DOE-NCAR, U.S. 288x192 Jan 1850-Dec 2005 
11 CESM1-FASTCHEM NSF-DOE-NCAR, U.S. 288x192 Jan 1850-Dec 2005 
12 CESM1-WACCM NSF-DOE-NCAR, U.S. 144x96 Jan 1850-Dec 2005 
13 CMCC-CESM CMCC, Italy 96x48 Jan 1850-Dec 2005 
14 CMCC-CM CMCC, Italy 480x240 Jan 1850-Dec 2005 
15 CMCC-CMS CMCC, Italy 192x96 Jan 1850-Dec 2005 
16 CNRM-CM5 CNRM-CERFACS, French 256x128 Jan 1850-Dec 2005 
17 CSIRO-Mk3-6-0 CSIRO-QCCCE, Australia 192x96 Jan 1850-Dec 2005 
18 EC-EARTH K Pa 320x160 Jan 1950-Dec 2012 
nations 
19 FGOALS-g2 FGOALS, China 128x60 Jan 1960-Jan 1999 
20 FGOALS-s2 FGOALS, China 128x108 Jan 1850-Dec 2005 
21 FIO-ESM FIO, China 128x64 Jan 1850-Dec 2005 
22 GFDL-ESM2G NOAA GFDL, U.S. 144x90 Jan 1961—Dec 2000 
23 GFDL-ESM2M NOAA GFDL, U.S. 144x90 Jan 1961—Dec 2000 
24 GISS-E2-H NASA GISS, U.S. 144x90 Jan 1951—Dec 2005 
25 GISS-E2-H-CC NASA GISS, U.S. 144x90 Jan 1951-Dec 2010 
26 GISS-E2-R NASA GISS, U.S. 144x90 Jan 1951-Dec 2005 
27 GISS-E2-R-CC NASA GISS, U.S. 144x90 Jan 1951-Dec 2005 
28 HadCM3 MOHC, England 96x73 Jan 1959-Dec 2005 
29 HadGEM2-AO NIMR/KMA, SK/England 192x145 Jan 1860—Dec 2005 
30 HadGEM2-CC MOHC, England 192x145 Dec 1959-Nov 2005 
31 HadGEM2-ES MOHC, England 192x145 Dec 1959-Nov 2005 
32 INMCM4 INM, Russia 180x120 Jan 1850-Dec 2005 
33 IPSL-CM5A-LR IPSL, French 96x96 Jan 1850—Dec 2005 
34 IPSL-CMSA-MR IPSL, French 144x143 Jan 1850-Dec 2005 
35 IPSL-CM5B-LR IPSL, French 96x96 Jan 1850-Dec 2005 
36 MIROC4h MIROC, Japan 640x320 Jan 1961—Dec 2000 
37 MIROC5 MIROC, Japan 256x128 Jan 1850-Dec 2012 
38 MIROC-ESM MIROC, Japan 128x64 Jan 1950-Dec 2005 
39 MIROC-ESM-CHEM MIROC, Japan 128x64 Jan 1850-Dec 2005 
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40 MRI-CGCM3 MRI, Japan 320x160 Jan 1850-Dec 2005 
41 NorESM1-M NCC, Norway 144x96 Jan 1850-Dec 2005 
42 NorESM1-ME NCC, Norway 144x96 Jan 1850-Dec 2005 


Table S3 Numbers of satisfactory calibrated and verified monthly average temperature (MAT) time series based 
on the CMIP5 and Climatic Research Unit (CRU) datasets using the delta, physical scaling (SP), artificial neural 
network (ANN) and No methods 


Number of satisfactory calibrated and verified MAT time series 


No. Station name 
19 pe =. e. En E CRU owe 

1 Akqi 88 87 41 42 0 5 2 86 
2 Aksu 90 85 41 42 0 7 3 87 
3 Baicheng 91 86 41 42 0 8 2 89 
4 Balguntay 102 89 41 42 0 19 2 100 
5 Barkol 69 55 41 27 0 1 2 67 
6 Bayanbulak 82 82 41 41 0 0 2 80 
7 Bole 91 84 41 42 0 8 2 89 
8 Caijiahu 92 84 41 41 0 10 2 90 
9 Dabancheng 93 87 41 42 0 10 3 90 
10 Daxigou 83 83 41 42 0 0 2 81 
11 Gongliu 90 88 41 42 0 7 2 88 
12 Hami 91 86 41 42 0 8 3 88 
13 Hejing 93 91 41 42 0 10 3 90 
14 Hoxud 93 90 41 42 0 10 3 90 
15 Hutubi 91 85 41 42 0 8 3 88 
16 Jiashi 86 83 41 42 0 3 2 84 
17 Jimsar 92 86 41 42 0 9 2 90 
18 Jinghe 102 93 41 42 0 19 2 100 
19 Kalpin 86 83 41 42 0 3 2 84 
20 Kashi 102 93 41 42 0 19 2 100 
21 Khorgas 99 89 41 42 0 16 3 96 
22 Korla 102 93 41 42 0 19 3 99 
23 Kumux 101 90 41 42 0 18 3 98 
24 Kuqa 91 85 41 42 0 8 3 88 
25 Manas 101 90 41 42 0 18 2 99 
26 Mori 100 90 41 42 0 17 3 97 
27 Mosuowan 102 90 41 42 0 19 2 100 
28 Naomaohu 85 84 41 42 0 2 3 82 
29 Nilka 91 89 41 42 0 8 2 89 
30 Paotai 101 91 41 42 0 18 2 99 
31 Oapgal 102 98 41 42 0 19 3 99 
32 Oitai 99 90 41 42 0 16 3 96 
33 Shanshan 83 83 41 42 0 0 2 81 
34 Shawan 105 99 41 42 0 22 2 103 
35 Shihezi 109 91 41 42 0 26 2 107 
36 Shisanjianfang 97 87 41 42 0 14 3 94 
37 Tekes 87 87 41 42 0 4 2 85 
38 Tianchi 82 75 41 40 0 1 1 81 
39 Toksun 85 83 41 42 0 2 3 82 
40 Tuergate 87 85 41 42 0 4 2 85 
41 Turpan 84 83 41 42 0 1 2 82 
42 Urumqi 100 89 41 42 0 17 2 98 
43 Usu 101 90 41 42 0 18 3 98 
44 Wenguan 127 94 41 41 35 10 2 125 
45 Wugia 83 83 41 42 0 0 2 81 
46 Wushi 85 83 41 41 0 3 1 84 
47 Xiaoguzi 91 79 41 41 1 8 2 89 
48 Xinhe 95 91 41 42 0 12 3 92 
49 Xinyuan 90 88 41 42 0 7 2 88 
50 Yangi 98 90 41 42 0 15 3 95 
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51 Yining 

52 Yiwu 

53 Zhaosu 
Table S4 


89 
135 
128 


86 41 42 0 6 
106 41 42 37 15 
119 41 42 42 3 


based on the CRU and CMIP5 datasets using the delta, SP, ANN and No methods 


No. 


Number of satisfactory calibrated and verified MAP time series 


Numbers of satisfactory calibrated and verified monthly accumulated precipitation (MAP) time series 
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Note: B, good; C, satisfactory. 


Table S5 Connection degree expressions between calibrated (verified) and observed MAT time series (first 20 
entries) at Zhaosu station 


No. Connection a bi bz ba 
degree Č v C V C Vv C v G Vv 
1 HR, T) 0.58 0.59 0.24 0.26 0.11 0.06 0.04 0.06 0.03 0.03 
2 MRi, T) 0.53 0.56 0.28 0.28 0.13 0.08 0.04 0.06 0.03 0.03 
3 MR", T) 0.51 0.56 0.31 0.27 0.13 0.10 0.03 0.05 0.02 0.03 
4 (Rök, T) 0.26 0.25 0.15 0.13 0.17 0.10 0.10 0.13 0.32 0.39 
5 H(Rivoaises T) 0.26 0.19 0.16 0.24 0.13 0.14 0.14 0.13 0.32 0.30 
6 H(Rexuesm, T) 0.25 0.14 0.15 0.27 0.16 0.13 0.1 0.13 0.33 0.33 
7 tlReiizca) T) 0.25 0.26 0.19 0.16 0.14 0.15 0.10 0.15 0.32 0.28 
8 H(Rétsmwacem, T) 0.25 0.23 0.16 0.14 0.14 0.16 0.1 0.14 0.34 0.33 
9 MREmosmass, T) 0.24 0.20 0.16 0.16 0.14 0.19 0.1 0.15 0.35 0.30 
10 H(Rxtrocesw-cuemy T) 0.24 0.15 0.16 0.25 0.14 0.17 0.1 0.09 0.36 0.34 
11 MREssean, T) 0.24 0.19 0.20 0.19 0.13 0.11 0.1 0.18 0.32 0.33 
12 (Rice, T) 0.24 0.23 0.22 0.20 0.14 0.15 0.1 0.13 0.30 0.28 
13 lRéisnn.eastenem, T) 0.24 0.22 0.17 0.24 0.14 0.11 0.10 0.11 0.35 0.33 
14 MRSS n:n, T) 0.24 0.23 0.19 0.18 0.14 0.15 0.12 0.14 0.32 0.30 
15 H(Retsmrastcum, T) 0.23 0.23 0.21 0.18 0.11 0.10 0.1 0.11 0.34 0.35 
16 H(Reoaisse, T) 0.23 0.23 0.15 0.19 0.17 0.17 0.10 0.12 0.35 0.30 
17 M(REssean, T) 0.23 0.17 0.18 0.24 0.13 0.18 0.12 0.08 0.34 0.33 
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18 U(RÈccon T) 0.23 0.21 0.18 0.15 0.14 0.18 0.12 0.12 0.34 0.35 
19 uR» T) 0.23 0.21 0.21 0.19 0.15 0.18 0.11 0.09 0.29 0.33 
20 H(Ristenoes, T) 0.23 0.18 0.18 0.21 0.15 0.14 0.12 0.13 0.32 0.34 


Note: a, identity degree; bı, b2 and bs, uncertainty component coefficients of the discrepancy degree; c, contrary degree; C, calibrated; V, 
verified. 


Table S6 Connection degree expressions between calibrated (verified) and observed MAP (first 20 entries) at 
Zhaosu station 


No Connection a bi by ba c 
: degree C v C Vv G V C v C v 
1 (Rís, P) 0.22 0.23 0.16 0.10 0.10 0.07 0.06 0.08 0.46 0.53 
2 H(Rinocat, P) 0.18 0.13 0.11 0.09 0.1 0.13 0.08 0.06 0.53 0.58 
3 UREE aim P) 0.16 0.13 0.12 0.09 0.10 0.13 0.06 0.08 0.57 0.56 
4 U(Rimms P) 0.16 0.12 0.14 0.13 0.09 0.11 0.07 0.05 0.54 0.60 
5 U(Riiroca P) 0.16 0.10 0.08 0.08 0.1 0.10 0.07 0.11 0.57 0.62 
6 W(Ryinocs, P) 0.15 0.12 0.13 0.13 0.10 0.08 0.08 0.08 0.55 0.59 
7 URE P) 0.15 0.17 0.14 0.13 0.1 0.06 0.07 0.07 0.53 0.58 
8 SN) 0.15 0.13 0.11 0.13 0.10 0.11 0.11 0.04 0.53 0.60 
9 U(RERomaso P) 0.15 0.13 0.13 0.13 0.07 0.08 0.09 0.10 0.56 0.57 
10 L(Riiti.cccy P) 0.15 0.18 0.12 0.13 0.1 0.10 0.07 0.07 0.55 0.53 
11 Rx. P) 0.15 0.13 0.11 0.10 0.10 0.09 0.08 0.11 0.56 0.57 
12 MRäiiccen, P) 0.15 0.14 0.11 0.08 0.1 0.10 0.09 0.10 0.54 0.58 
13 MRXtcessio, P) 0.14 0.13 0.11 0.10 0.08 0.11 0.08 0.08 0.59 0.58 
14 URSo P) 0.14 0.12 0.09 0.06 0.06 0.08 0.06 0.10 0.66 0.64 
15 H(Risacesoes, P) 0.14 0.11 0.09 0.08 0.08 0.09 0.07 0.07 0.61 0.65 
16 M(Rsceessio, P) 0.14 0.08 0.13 0.14 0.1 0.12 0.08 0.09 0.54 0.58 
17 st) 0.14 0.13 0.14 0.15 0.12 0.12 0.08 0.07 0.53 0.53 
18 MRirscusasa, P) 0.14 0.14 0.13 0.11 0.09 0.13 0.08 0.07 0.56 0.55 
19 L(Ryitioc-esms P) 0.14 0.1 0.13 0.16 0.09 0.08 0.09 0.06 0.56 0.60 
20 M(Resnss.cns, P) 0.14 0.15 0.14 0.08 0.09 0.12 0.07 0.05 0.56 0.60 


Note: a, identity degree; bı, b2 and bs, uncertainty component coefficients of the discrepancy degree; c, contrary degree; C, calibrated; V, 
verified. 


Table S7 Number of positive connection degrees for calibrated and verified MAT time series based on the CMIP5 
and CRU datasets using the delta, SP, ANN and No methods 


Station Number of positive connection degrees for calibrated and verified MAT time series 


No. Delta ANN No 
name Total Sistod SP method snethod Sietiod CRU CMIP5 
1 Akgi 4 2 2 0 0 2 2 
2 Aksu 19 8 10 0 1 3 16 
3 Baicheng 2 1 1 0 0 2 0 
4 Barkol 2 1 1 0 0 2 0 
5 Balguntay 19 1 18 0 0 2 17 
6 Bayanbulak 2 1 1 0 0 2 0 
7 Bole 2 1 1 0 0 2 0 
8 Caijiahu 2 1 1 0 0 2 0 
9 Dabancheng 3 1 1 0 1 3 0 
10 Daxigou 14 1 13 0 0 2 12 
11 Gongliu 2 1 1 0 0 2 0 
12 Hami 3 1 1 0 1 3 0 
13 Hejing 3 1 2 0 0 2 1 
14 Hoxud 2 1 1 0 0 2 0 
15 Hutubi 2 1 1 0 0 2 0 
16 Jiashi 4 1 3 0 0 2 2 
17 Jinghe 2 1 1 0 0 2 0 
18 Kalpin 3 2 1 0 0 2 1 
19 Khorgas 2 1 1 0 0 2 0 
20 Korla 10 4 6 0 0 2 8 
21 Kuga 5 3 1 0 1 3 2 
22 Kumux 4 2 2 0 0 2 2 
23 Manas 2 1 1 0 0 2 0 
24 Mori 3 1 1 0 1 3 0 
25 Mosuowan 2 1 1 0 0 2 0 
26 Naomaohu 2 1 1 0 0 2 0 
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27 Nilka 2 1 1 0 0 2 0 
28 Paotai 2 1 1 0 0 2 0 
29 Oapgal 2 1 1 0 0 2 0 
30 Oitai 3 1 1 0 1 3 0 
31 Shanshan 1 1 0 0 0 1 0 
To be continued 
Continued 
fa Station Number of positive connection degrees a calibrated ane verified MAT time series 
o. elta o 
name Total method SP method method iveihad CRU CMIP5 

32 Shawan 2 1 1 0 0 2 0 
33 Shihezi 2 1 1 0 0 2 0 
34 — Shisanjianfang 3 1 1 0 1 3 0 
35 Tekes 2 1 1 0 0 2 0 
36 Tianchi 3 1 1 1 0 3 0 
37 Toksun 2 1 1 0 0 2 0 
38 Tuergate 5 2 3 0 0 2 3 
39 Turpan 2 1 1 0 0 2 0 
40 Urumqi 2 1 1 0 0 2 0 
41 Usu 2 Í 1 0 0 2 0 
42 Wenguan 1 1 0 0 0 1 0 
43 Wugia 2 1 1 0 0 2 0 
44 Wushi 3 2 1 0 0 3 0 
45 Xiaoguzi 4 1 1 1 1 4 0 
46 Xinhe 22 10 11 0 1 3 19 
47 Xinyuan 2 1 1 0 0 2 0 
48 Yangi 15 6 8 0 a 3 12 
49 Yining 2 1 1 0 0 2 0 
50 Yiwu 3 1 2 0 0 2 1 
51 Zhaosu 3 al 1 1 0 3 0 


Table S8 Number of positive connection degrees for calibrated and verified MAP time series based on the CMIP5 
and CRU datasets using the delta, SP, ANN and No methods 


Number of positive connection degrees for calibrated and verified MAP time series 


Station 


No. Delta SP ANN No 
1808 Total method method method method CRU GMIPS 
1 Aksu 101 43 30 28 0 3 98 
2 Balguntay 48 43 0 0 2 46 
3 Bayanbulak 7 7 0 0 0 0 7 
4 Caijiahu 3 1 1 1 0 3 0 
5 Dabancheng 97 43 30 24 0 3 94 
6 Hami 137 43 41 36 17 3 134 
7 Hejing 121 43 38 34 6 4 117 
8 Hoxud 61 43 4 10 4 3 58 
9 Jiashi 110 43 34 28 5 3 107 
10 Jinghe 54 34 3 8 9 3 51 
11 Kalpin 53 43 3 4 3 4 49 
12 Kashi 127 43 43 40 1 4 123 
13 Korla 135 43 41 34 17 4 131 
14 Kuqa 105 43 29 29 4 4 101 
15 Kumux 129 43 39 37 10 4 125 
16 Manas 4 1 1 1 1 4 0 
17 Mosuowan 4 1 1 1 1 4 0 
18 Naomaohu 137 43 43 42 9 4 133 
19 Paotai 3 1 1 0 1 3 0 
20 Qitai 4 1 1 1 1 4 0 
21 Shanshan 132 43 43 43 3 3 129 
22 Shisanjianfang 131 43 43 43 2 3 128 
23 Toksun 63 0 42 21 0 1 62 
24 Tuergate 1 1 0 0 0 1 0 
25 Turpan 63 1 41 21 0 1 62 
26 Urumqi 3 1 1 1 0 3 0 
27 Usu 4 1 1 1 1 4 0 
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28 Wugia 1 1 0 0 0 1 0 
29 Wushi 10 9 1 0 0 2 8 
30 Xinhe 61 43 3 10 5 4 57 
31 Yangi 59 43 4 7 5 4 55 
32 Yining 4 1 1 1 1 4 0 
33 Yiwu 58 43 1 3 11 4 54 


